Prepared for submission to JCAP 



The large-scale Quasar-Lyman a 
Forest Cross-Correlation from BOSS 



Andreu Font-Ribera a 6 , Eduard Arnau c , Jordi Miralda-Escude rfc 
Emmanuel Rollinde' , J. Brinkmann^ , Joel R. Brownstein 9 , 
Khee-Gan Lee h , Adam D. Myers* , Nathalie 
Palanque-Delabrouille J , Isabelle Paris fc , Patrick Petitjean 6 , 
James Rich 7 , Nicholas P. Ross 6 , Donald P. Schneider' m and 
Martin White 6 " 



institute of Theoretical Physics, University of Zurich, 8057 Zurich, Switzerland 
^Lawrence Berkeley National Laboratory, 1 Cyclotron Road, Berkeley, CA, USA 
c Institut de Ciencies del Cosmos (IEEC/UB), Barcelona, Catalonia 
^Institucio Catalana de Recerca i Estudis Avangats, Catalonia 

e Universite Paris 6 et CNRS, Institut d'Astrophysique de Paris, 98bis blvd. Arago, 75014 
Paris, France 

•f Apache Point Observatory, P.O. Box 59, Sunspot, NM 88349, USA 

^Department of Physics and Astronomy, University of Utah, 115 S 1400 E, Salt Lake City, 
UT 84112, USA 

h Max-Planck-Institut fiir Astronomie, Konigstuhl 17, D-69117 Heidelberg, Germany 
department of Physics and Astronomy, University of Wyoming, Laramie, WY 82071, USA 
J 'CEA, Centre de Saclay, IRFU, 91191 Gif-sur-Yvette, France 

fc Departamento de Astronomfa, Universidad de Chile, Casilla 36-D, Santiago, Chile 
'Department of Astronomy and Astrophysics, The Pennsylvania State University, University 
Park, PA 16802, USA 

"Institute for Gravitation and the Cosmos, The Pennsylvania State University, University 
Park, PA 16802, USA 

"Departments of Physics and Astronomy, 601 Campbell Hall, University of California Berke- 
ley, CA 94720, USA 



E-mail: font@physik.uzh.ch 



Abstract. We measure the large-scale cross-correlation of quasars with the Lya forest 
absorption in redshift space, using ~ 60000 quasar spectra from Data Release 9 (DR9) of 
the Baryon Oscillation Spectroscopic Survey (BOSS). The cross-correlation is detected over 
a wide range of scales, up to comoving separations r of 80 h~ 1 Mpc. For r > 15/i _1 Mpc, 
we show that the cross-correlation is well fitted by the linear theory prediction for the mean 
overdensity around a quasar host halo in the standard ACDM model, with the redshift 
distortions indicative of gravitational evolution detected at high confidence. Using previous 
determinations of the Lya forest bias factor obtained from the Lya autocorrelation, we infer 
the quasar bias factor to be b q = 3.64^Q'Jg at a mean redshift z = 2.38, in agreement with 
previous measurements from the quasar auto-correlation. We also obtain a new estimate of 
the Lya forest redshift distortion factor, f3p = 1.1 ± 0.15, slightly larger than but consistent 
with the previous measurement from the Lya forest autocorrelation. The simple linear model 
we use fails at separations r < 15 h^ 1 Mpc, and we show that this may reasonably be due 
to the enhanced ionization due to radiation from the quasars. We also provide the expected 
correction that the mass overdensity around the quasar implies for measurements of the 
ionizing radiation background from the line-of-sight proximity effect. 

Keywords: large-scale structure: redshift surveys — large-scale structure: Lyman alpha 
forest 
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1 Introduction 

As the most optically luminous objects known in the universe, quasars are used as lampposts 
at high redshift to obtain absorption spectra of the intervening intergalactic medium, as well 
as tracers of large-scale structure. Their absorption spectra blueward of the Lya emission 
line reveal the Lya forest, reflecting the structure in the hydrogen gas density in the inter- 
galactic medium as it evolves through gravitational collapse around dark matter halos in 
which galaxies form (e.g., [1],[2], [3], [4]). 

The large-scale clustering of quasars was measured in the 2dF survey (e.g. [5], [6], [7]) 
and in the Sloan Digital Sky Survey (SDSS, [8], [9], [10]). Both the Lya forest and the quasar 
clustering can be used as tracers of the underlying large-scale mass fluctuations, which are 
thought to have an origin in the initial conditions of the universe. In the linear regime, the 
observed quasar correlation function should be equal to the mass autocorrelation times the 
square of the mean bias factor of the quasar host halos. Recent results from the analyses of 
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data from large-scale surveys have indicated a bias factor that increases with redshift and 
has a value b q = 3.8 ± 0.3 at z = 2.4, and is nearly independent of quasar luminosity (see 
[11], [12]). 

Quasar clustering can also be probed by means of the cross-correlation with other trac- 
ers. The quasar cross-correlation with galaxies was measured by [13], [14] and [15]. These 
studies found that the quasar bias factor has a value near unity, comparable to typical star- 
forming galaxies, at redshift z < 1, but the small samples at higher redshift already indicated 
a larger clustering amplitude. Quasars can also be cross-correlated with absorption systems 
found in the spectra of other quasars. This can be done with the hydrogen Lya forest, with 
a high abundance of absorption features, and also with more sparse metal line systems such 
as the CIV lines, which was recently accomplished by [16]. 

The cross-correlation of quasars with the Lya forest absorption was first searched for 
along the same line of sight of each individual quasar, looking for the impact of the quasar 
ionizing radiation reducing the Lya absorption, which has been designated as "proximity 
effect" or "inverse effect" ([17], [18], [19]). The ionizing radiation emitted by a quasar is 
added to the intensity of the cosmic ionizing background, and the higher than average in- 
tensity in the quasar vicinity implies an increased degree of ionization of the intergalactic 
medium, and therefore a decreased absorption by the Lya forest. Studies of the proximity 
effect have therefore expected a lower than average absorption of the Lya forest near the 
quasar Lya emission line compared to the absorption at the same redshift seen in quasars at 
higher redshift, using this to infer the intensity of the cosmic ionizing background (see [20] 
for a more recent study). These investigations have generally not included in the analysis 
the fact that the intergalactic gas should have higher density near a quasar, because of the 
positive correlation of the quasar host halo with the mass density. In reality, the observa- 
tions of the quasar-Lya absorption cross-correlation should reveal the combined effect of the 
mean overdensity and the additional ionizing intensity around quasars, which substantially 
complicates the theoretical interpretation. 

In general, the quasar-Lya cross-correlation can be measured in redshift space from 
Lya forest lines of sight near another quasar, as a function of the perpendicular and parallel 
components of the separation, a and it, respectively. The effect of a quasar on the Lya forest 
in the spectrum of another nearby quasar was investigated in several papers that examined 
individual quasar pairs, generally separated by a few arc minutes ([21], [22], [23], [24]). These 
observations generally found no evidence for any decrease of Lya absorption near quasars due 
to the excess ionizing radiation. In fact, both in the case of quasar pairs at small separation, 
and in the case of using a larger number of pairs at wider separations, it has been found that 
the Lya absorption is stronger near quasars, rather than weaker ([25], [26], [27], [28], [29]), 
[30]. This result has been attributed to the mean overdensity near a quasar, combined with 
the reduction of the ionizing intensity from the quasar due to both anisotropic emission and 
time variability of the quasars, as discussed by [23], [26] and [27]. These works obtained 
upper limits to the luminosity of the quasars emitted in the perpendicular direction with 
a time delay, although no detailed analysis was done to attempt to model the effect of the 
overdensity and provide a robust interpretation of the data. 

This question can now be investigated with the large sample of quasar spectra provided 
by the Baryon Oscillation Spectroscopic Survey (BOSS) of the SDSS-III Collaboration ([31], 
[32]). The DR9 Catalogue of quasars ([33]) already contains 87822 quasars, with more than 
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60000 of them at z > 2, distributed over 3275 square degrees of the sky. The extensive 
area covered and the large number of quasars makes this sample particularly useful to study 
the quasar-Lya cross-correlation at large scales (i.e., the typical nearest neighbor projected 
separation of ~ 10/i _1 Mpc and larger). 

At these separations, the cross-correlation function induced by the mass over density 
around quasars can be predicted from linear theory, and should be proportional to the product 
of the bias factors of the quasars and the Lya forest and show the expected redshift distortions 
([34], [35]). These bias factors can be independently determined by observations of the 
autocorrelations of the Lya forest and the quasars ([4], [12]), and should agree with the 
measured amplitude of the cross-correlation. 

This paper presents the quasar-Lya cross correlation measured from the quasars in 
the DR9 catalogue of BOSS. The data sample is presented in section 2, and the method of 
analysis in section 3. Linear theory is used to model the mean cross-correlation, and the fits 
to the observational results are presented in section 4, showing that the overdensity effect 
alone adequately matches the results at comoving separations r > 15 /i -1 Mpc for reasonable 
values of the bias factors. Section 5 discusses the expected effect of the ionizing radiation 
of the quasars on the cross-correlation for the quasar luminosities in our sample, and the 
manner that this may impact the results in section 4. Our conclusions are summarized in 
section 6. 

Throughout this paper we use the flat ACDM cosmology, with Q m = 0.281, Uf, = 0.0462, 
h = 0.71, n s = 0.963 and erg = 0.8, consistent with the cosmological parameters from the 
WMAP mission [36]. 

2 Data Set 

The data used in this paper is from the publicly available 9th Data Release (DR9, [37]) of the 
SDSS-III Collaboration ([31], [38], [39], [40], [41], [42], [43]), comprising the first two years 
of observations of the Baryon Oscillation Spectroscopic Survey (BOSS, [32]). The quasar 
sample is a subsample of the catalogue described in [33], while the Lya absorption sightlines 
make use of the products described in [44]. 

We first describe the cuts we apply to the catalogue to select our quasar sample, and then 
the set of quasar lines of sight that we use for the Lya absorption field to be cross-correlated 
with the quasars. 

2.1 Quasar sample 

A total of 87822 quasars are present in the DR9 quasar catalogue ([33]). These quasars were 
targeted for spectroscopy using a complex target selection procedure presented in [45] that 
combines a series of methods described in [46] , [47] , and [48] . The French Participation Group 
(FPG) of the SDSS-III Collaboration verified each of these objects by visually inspecting the 
spectra. A number of estimates of the quasar redshift based on different methods are provided 
in this catalogue. We generally use the redshift obtained using the Principal Component 
Analysis method (Z_PCA in DR9Q), but in Section 4.6 we examine our results for the quasar- 
Lya cross-correlation when using other estimates for the quasar redshifts. 
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Figure 1. Left panel: Distribution of the 61342 quasar redshifts in our sample. Right panel: i-band 
absolute magnitude distribution. 



In order to have a well defined redshift interval in our sample, we use only the 61366 
quasars with a redshift in the range 2 < z q < 3.5. Most of the 26456 quasars discarded by 
this criteria are at a redshift that is too low to be cross-correlated with the observed Lya 
absorption spectra, and the few at z q > 3.5 have too few high-redshift nearby lines of sight to 
yield useful results. Finally, we apply a cut in i-band absolute magnitude, —30 < Mj < —23, 
which reduces the final sample to 61342 quasars. The magnitudes are provided in the quasar 
catalog ([33]), and were computed using a similar cosmological model than the one used in 
this study. The redshift and i-band absolute magnitude distributions of our quasar sample 
are shown in figure 1. 



2.2 Lya sample 

Not all quasars present in the catalogue are useful and free of systematic effects for using the 
Lya forest spectrum: they may be affected by broad absorption lines, they may have too low 
a redshift so that their Lya forest pixels lie in the noisy blue end of the BOSS spectrograph, 
or their continuum may be too difficult to model. For these reasons, we select the same 
quasars as in [49], a previous study of the cross-correlation of the Lya forest with Damped 
Lya Systems (DLAs) using the DLA catalogue of [50]. This reduces the number of available 
lines of sight to 52449. The selection criteria are that the quasar redshift lies in the range 
2.15 < z < 3.5, that the BAL_FLAG_VI flag is not set in the catalogue of [33], and that 
the median signal-to-noise ratio per pixel (of width ~ 69kms _1 ) in the quasar rest-frame 
wavelength range 1220 A < A r < 1600 A is S/N > 0.5. 

We define the Lya forest region as the rest-frame wavelength range 1041 A < A r < 
1185 A, and use the 'value-added' co-added spectra made publicly available with DR9 ([44]), 
removing any pixels with the bit mask or the sky mask set, as defined in [44]. We also correct 
the pipeline estimate of the noise variance using the recipe described in [44] . 

Lines of sight that include no more than one detected DLA from the "DLA concordance 
catalogue" in [51] are included in the analysis, after masking the central region of the DLA 
absorption line and correcting for the damped wings outside this central region, as explained 
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in [44]. Since the catalogue of [51] is not complete (especially in low S/N spectra), some 
residual contamination of DLAs is expected. 



3 Method 

The method we use to measure the quasar-Lya cross-correlation is the same one as for 
the cross-correlation of Damped Lyman Alpha systems (DLAs) with the Lya absorption, 
described in [49]. Here, a brief summary of the method is presented, discussing in detail only 
the issues that are special for our analysis of the quasar cross-correlation and any differences 
with respect to the method used for DLAs. 

The observed flux in each pixel of the quasar spectra is = F(zi) [1 + 5Fi] + Ni, 
where Cj is the quasar continuum (equal to the flux that would be observed in the absence of 
absorption), F(zi) is the mean transmitted fraction, 5f% is the Lya transmission fluctuation, 
and Ni is the observational noise, which is assumed to be uncorrelated in all pixel pairs. 
To estimate 6 Ft and its cross-correlation we must first model the continua of the quasars. 
This is done using the PCA technique described in [52], but without applying the Mean Flux 
Regulation described in this paper, which can suppress the large-scale power in the Lya forest 
in ways that are difficult to model. In the same way as in [49], we generally apply instead the 
Mean Transmission Correction (hereafter MTC), which enforces the mean transmission in 
each quasar spectrum to be equal to the value measured in independent observations by [53] , 
using equations (3.5) and (3.6) in [49]. This correction is useful to remove the broadband 
noise caused by spectrophotometric errors, but we will also show results when no correction 
to the quasar continua is applied. We will see in section 4 that including the MTC increases 
the accuracy of the measured quasar bias in our parameterized model by ~ 30%. However, 
this correction distorts the cross-correlation function and the fitted theoretical model must 
be corrected to take this effect into account (see Appendix A in [49]). 

The mean transmitted fraction F is measured here in 150 bins of Az = 0.01 over the 
range 1.9 < z < 3.4 (no Lya forest data is used outside this range). These redshift bins are 
three times smaller than the ones used in [49] for the same purpose. The measured F(zi) 
has fluctuations when using small redshift bins owing to systematic errors in the calibrating 
reference stars ( [54] , [44] ) , and we found that the fine redshift bins are necessary to correctly 
eliminate the effect of these fluctuations. In general, the larger number of available quasars 
compared to DLAs allows for a more accurate measurement of the cross-correlation for the 
quasars, and therefore greater care needs to be taken in the analysis for quasars. 

The cross-correlation is computed with the simple estimator (see Appendix B of [49] for 
a discussion of the approximations involved and the differences with an optimal estimator): 



where the summation is done over all quasars and over all Lya pixels that are within a bin 
(A) of the separation from each quasar in the perpendicular (cr) and parallel (tt) directions, 
Sf% is the estimated value of the transmission fluctuation from the observed flux and the 
continuum model, and the weights Wi are computed independently at each pixel from the 
noise Ni provided for the DR9 data [38], assuming a model for the intrinsic Lya absorption 
variance that is added in quadrature to the noise (equation 3.10 in [49]). 
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A set of 9 bins in a and 18 bins in tt are used to measure the cross-correlation, 
which are the same ones as in [49] except that we add the extra bin at large separations 
60 /i -1 Mpc < a < 80 /i^ 1 Mpc, and the same for both signs of ir (the cross-correlation is 
measured without assuming symmetry with respect to a sign change of the parallel separa- 
tion 7r). This procedure yields a total of 162 bins. The weighted average values of (tt, a) of all 
the contributing pixel pairs to every bin A are computed together with the cross-correlation 
from equation (3.1), using the same weights. These averages are generally close but not 
exactly equal to the central values of each bin. The models to be fitted are evaluated at these 
weighted averages of (cr,ir). A single redshift bin is generally used for the mean redshift z 
of the Lya forest pixel and the quasar, which is required to be in the range 2.0 < z < 3.5, 
although some results are also presented in the next section using three redshift bins for the 
purpose of testing redshift evolution. 

The errors of the cross-correlation are computed in two different ways: 1) The covariance 
matrix is estimated assuming Gaussianity and a model power spectrum for the Lya forest 
intrinsic autocorrelation. 2) Our quasar sample is divided into twelve subsamples in adjacent 
areas of the sky, and the cross-correlation is computed separately in each subsample to infer 
bootstrap errors. The method used and the subsample areas are the same as in [49]. The 
covariance matrix is computed including only pixel pairs up to a transverse separation a < 
5 Mpc in order to make the computer time required for the calculation more manageable. 
Most of the contributions to the covariance comes from pixel pairs in the same line of sight 
(a = 0). 

We use the same linear theory model described in Section 3.6 of [49] (their equation 3.16), 
with bias parameters b q and &f, and redshift distortion parameters (5 q and /3f, for the quasars 
and the Lya forest, respectively. The quasar redshift distortion parameter obeys the relation 
(3 q = f(£l)/bq, and for the Lya forest we impose the condition bpil + f3p) = —0.336 from the 
observational result of [4] obtained from the measured Lya autocorrelation at z = 2.25. We 
also impose that (Hp and b q are constant with redshift and bp oc (1 + z) 2 ' 9 , as discussed in [4]. 
However, whereas the effect of redshift errors for DLAs could be neglected, this is not the 
case for quasars. We therefore add two extra free parameters to the model: a dispersion e z 
and a mean offset A z of the quasar redshift error, assuming these errors to be Gaussian. The 
theoretical model for the linear cross-correlation is smoothed with a Gaussian in the parallel 
direction with this dispersion and offset. 

Fits to this model are generally done using only separation bins at r = (a 2 + 7T 2 ) 1 / 2 > 
15 h^ 1 Mpc in order to avoid the near region that is possibly affected by radiation and non- 
linear effects, although some fits will also be shown using all bins at r > 7/i Mpc. The 
theoretical prediction is corrected for the MTC using the equations explained in Appendix 
A of [49]. 

4 Results 

4.1 Fiducial model 

The results of the cross-correlation of the Lya transmission with quasars, with the method 
presented in section 3, are shown in figure 2 for each bin in the transverse separation a. The 
error bars are the diagonal elements of the computed covariance matrix, which we have found 
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Figure 2. Measured cross-correlation in the indicated bins of perpendicular separation a, as a 
function of the parallel separation ir. The data points in green have a total separation r = (a 2 + 
tj.2) 1 / 2 < 25 h~ x Mpc, and are not used in most of our fits. Solid (dashed) dark (blue) lines show the 
best fit model for the fiducial analysis, when using bins with separations down to r — 15 /i -1 Mpc 
(r = 7/i- x Mpc). 



to be consistent with the bootstrap errors from the scatter of the cross-correlation function 
measured in subsamples. Two model fits to the data are also plotted in figure 2: the solid 
(black) line uses only bins with separation r = (a 2 + 7T 2 ) 1 / 2 > 15 /i -1 Mpc, and the dashed 
(blue) line uses all the bins with r > 7 ' h~ l Mpc. The data points in green have a separation 
r < 15 hT x Mpc and are not used in our main analysis. The same results are shown as contour 
plots in the left panel of figure 3, with our best fit fiducial model, the one using only the 
r > 15/i _1 Mpc bins, shown in the right panel. 

Our fiducial model fit applies the MTC to the data and includes the corresponding 
correction to the theory, and uses the whole sample of 61342 quasars described in section 2 
with the PCA quasar redshifts. The model has four free parameters, as described at the end 
of section 3: /3p (the Lya redshift space distortion parameter), b q (the quasar bias factor), 
e z (the rms of the quasar redshift error distribution), and (the mean offset of the quasar 
redshift), while the Lya forest bias parameter bp is computed for each value of /3p using the 
well-constrained quantity bp{\ + ftp) = —0.336 at z = 2.25 from the Lya autocorrelation 
result of [4]. The first row in table 1 gives the parameters for the best fit result (in the sense 
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Figure 3. Two dimensional contours of the measured cross-correlation (left panel), compared to 
the best fit theoretical models for r > 15/i _1 Mpc (right panel). The black circle corresponds to 
r = 15 /i" 1 Mpc. 

of minimum value of x 2 ) using the covariance matrix as described earlier, with the value of 
X 2 and the number of degrees of freedom given in the last column. Errors correspond to the 
contours of Ax 2 = 1, after marginalizing over all other parameters. In the case of e z , only 
an upper limit is provided when e z = is within the Ax 2 = 1 contour. The parameters 
for other models that will be described below are given in the additional rows of the table. 
The variables related to quasar redshift errors, e z and A 2 , are expressed in units of kms~ , 
reflecting the directly-measured separation along the line of sight. 

Our basic result, seen in the figures and table 1, is that the simple linear theory model for 
the cross-correlation of quasars as tracers of the mass distribution and the Lya forest provides 
an excellent fit to all the data at large scales. Moreover, the predicted redshift distortions are 
an excellent match to the observed cross-correlation, as seen in figure 3, confirming the large- 
scale mass inflow toward the quasar host halos expected from the gravitational evolution 
of density perturbations. The quasar bias factor required to match the cross-correlation is 
b q = 3.64^{] 'jj, in excellent agreement with the independently-determined bias factor from 
the quasar auto-correlation, b q = 3.8 ± 0.3, from [12]. The redshift distortion parameter of 
the Lya forest is found to be j3p = I.I^q'jT, also in good agreement with the measurement of 
[4]. Finally, we find that the best match of the quasar redshift error distribution requires a 
significant mean offset of A z = — 160 km s -1 , with the negative sign indicating that the PCA 
quasar redshifts are on average too small (so the cross-correlation seen in figure 3 is shifted 
to a redshift higher than that of the quasar), and a surprisingly small error dispersion of 
e z < (370, 490) kms" 1 at the x 2 = (1> 4) confidence levels. Note that e z is the combination of 
the observational error of the quasar redshift and the intrinsic velocity dispersion of quasars 
with respect to their host halo. We need to keep in mind, however, that this upper limit is 
obtained using only the pixels at r > 15 h~ l Mpc, and that the value of e 2 has degeneracies 
with slight modifications of our simple four-parameter fiducial model. 

This degeneracy is well illustrated in our model by the x 2 contours for the parameters 
j3 f and e z , shown in figure 4 (right panel). The quadrupole moment of the cross-correlation 
is determined mostly by these two parameters. The quadrupole moment increases with /3f 
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Table 1. Best fit parameters and x 2 f° r the different analyses: FIDUCIAL (with the MTC and 
the corrected theory, using r > 15 /i -1 Mpc), r > 7/i~ 1 Mpc (extending to smaller scales), NOCOR 
(MTC, uncorrected theory), NOMTC (PCA-only continuum fitting, uncorrected theory), data split 
in rcdshift bins (LOW-Z for 2 < z < 2.25, MID-Z for 2.25 < z < 2.5, HIGH-Z for 2.5 < z < 3.5), 
data split in quasar absolute magnitude (LOW-L for —25.2 < Ms < —23, MID-L for —26.1 < 
Mi < —25.2, and HIGH-L for —30 < Mj < —26.1) and finally different quasar redshift estimates 
(Z_VISUAL,Z_PIPELINE,Z_CIV, Z.CIII,Z.MgII). Uncertainties correspond to values with A X 2 = 1, 
with upper limits for e z when A% 2 < 1 at e z = 0. 




Figure 4. Contours of x 2 in the two-parameter plane of the Lya forest rcdshift distortion parameter 
versus: quasar bias (left), and quasar redshift error dispersion (right). The number of degrees of 
freedom is 130. 



and decreases with e z , but the effect of e 2 is obviously important only at small radius (the 
dependence of the quadrupole moment on the quasar bias factor is relatively small because 
of the requirement /3 q = f(Cl)/b q , which implies a small value for /3 g ). A modification of our 
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model at small radius owing to the radiation effects of the quasars or non-linearities may 
modify the best fit values of e z and A 2 . The left panel of figure 4 also demonstrates that the 
quasar bias b q tends to slightly increase with increasing (3p for a given analysis. 

The value of x 2 = 116 for our fiducial model fit for 130 degrees of freedom, and the 
agreement of the fitted parameters with other independent determinations, shows that the 
observed cross-correlation is sufficiently well reproduced without the need to include the effect 
of the quasar ionizing radiation. Clearly, the linear overdensity around the quasar host halo 
is the dominant effect when measuring the quasar-Lya cross-correlation away from the line 
of sight, on large scales and for the luminosities of the BOSS quasars. 

The second row of table 1 gives the best fit parameters when all the bins at separations 
down to r > 7/i _1 Mpc are used. The best fit shifts to a larger redshift distortion parameter 
of the Lya forest, a lower quasar bias factor, and a larger quasar redshift error dispersion. The 
X 2 worsens significantly, with an increase of 48 when adding only 22 degrees of freedom. This 
result suggests that our 4-parameter model does not include all the important physical effects 
when analyzing the entire range of separations in figure 2. Non-linearities and radiation effects 
are likely to play a role at small separations, and a more complex analysis will be required 
to discern this. 

4.2 Bootstrap errors 

In general, we have tested that the bootstrap errors computed as described in section 3 are 
in agreement with the errors derived from the covariance matrix. We mention here as an 
example this error comparison for the quasar bias factor, when keeping the other parameters 
fixed to their best fit value of our fiducial analysis. The covariance matrix yields errors from 
the A% 2 = 1 contour of b q = 3.634^Q'}gg, and the result of 100 bootstrap realizations from 
the 12 subsamples of our data set (see [49]) is b q = 3.657 ± 0.135, in perfect agreement with 
the standard analysis. In the rest of the paper we use the uncertainties set by the contours 
of Ax 2 = 1. 

4.3 Scale dependence of the quasar bias factor 

In figure 5 we present the value of the quasar bias that is obtained when using only bins 
within narrow rings of the separation r. For this analysis, we fix the other parameters to 
their best fit value when using the range 15 h' 1 Mpc < r < 100 h" 1 Mpc, which are j3 F = 1.1, 
e z = 2.42 h~ l Mpc, A z = — 1.57/i _1 Mpc. The horizontal lines in the figure show the best fit 
and uncertainties when using all separations above r > 15/i _1 Mpc, i.e., all the points lying 
to the right of the dotted vertical line. 

The constancy of the bias factor for r > 15 h~ l Mpc is again a success of the simple linear 
theory model for large scales, meaning that the radial dependence of the cross-correlation 
agrees with the prediction of the standard ACDM model of structure formation. The smaller 
value of b q at smaller separation confirms our previous conclusion that other effects are likely 
to be important at r < 15 h^ 1 Mpc. 

The amplitude of the cross-correlation is proportional to bpb q a%, this is the actual 
quantity we are measuring, and the value for b q plotted in figure 5 assumes that 6f(1 + f3p) = 



- 10 - 



4.4 
4 
3.6 

-Q 2.8 



1 1 1 1 — 1 — 1 — 1 


1 


1 1 1 1 1 1 1 — 






M 


1 1 1 1 1 1 1 


1 





10 100 
r (Mpc/h) 



Figure 5. Fitted QSO bias b q in several bins of the separation r, when fixing the other parameters 
to their best fit value in the fiducial model, /3 F = 1.1, e z = 2.42 h^ 1 Mpc, A z = -1.57 h^ 1 Mpc. 
The horizontal lines show the best fit and uncertainties when using all separations larger than r > 
15 h^ 1 Mpc, the scale that is marked with a vertical dotted line in the figure. 



—0.336 at z = 2.25 (corrected to the mean quasar redshift z q = 2.38, where bp(l + Pf) = 
-0.376), and a 8 = 0.8. 

4.4 Impact of the Mean Transmission Correction (MTC) 

Our fiducial model uses the Mean Transmission Correction (MTC) as part of the continuum 
fitting, and corrects the theoretical model accordingly by using the analytical expression 
derived in appendix A of [49]. As a test of the importance of this correction, table 1 gives 
the results for two additional models: the NO COR case (fourth row) treats the observations 
in the same way as the fiducial model (applying the MTC to the data), but does not correct 
the theoretical model. The result is a considerably worse fit (A% 2 = 26), confirming the 
validity and the need for the theoretical correction. In the NOMTC case, the data for 
the transmission fluctuation is obtained with the direct use of the PCA continua, without 
applying the MTC. The uncorrected theory then fits the data properly, but the errors of the 
best fit parameters are considerably worse, supporting our reasons to use the MTC. An even 
smaller upper limit for e z is obtained for the NOMTC case, indicating that this upper limit 
is questionable because of its degeneracy with other model variations. 

4.5 Evolution with redshift and variation with quasar luminosity 

We test here for the dependence of our measured quasar bias on the redshift and quasar 
luminosity. In table 1 we show the results obtained when splitting the quasar sample in three 
redshift bins: LOW-Z (2.00 < z < 2.25), MID-Z (2.25 < z < 2.50) and HIGH-Z (2.50 < z < 
3.50). There is no clear evidence of evolution in the quasar bias parameter b q , although our 
errors are large and the redshift range that is probed is limited. 

In the same table we show the results obtained when splitting the sample in three 
luminosity bins: LOW-L for -25.2 < M; < -23.0, MID-L for -26.1 < M t < -25.2, and 
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HIGH-L for —30.0 < Mi < —26.1. Again, the changes are not significant, and unfortunately 
the quasar redshift error distribution may vary with the quasar luminosity, making it difficult 
to search for any physical dependence of the cross-correlation with quasar luminosity because 
of the parameter degeneracies. 

4.6 Variation with the quasar redshift estimator 

There are six different quasar redshift estimators specified in the DR9 quasar catalog ([33]). 
In the main part of this study we use the PCA redshift estimator (Z_PCA in DR9Q), but 
we also show in table 1 the results obtained when using the other ones: the visual inspection 
redshift (Z_VI), the estimator from the BOSS pipeline (Z_PIPELINE), and three estimators 
that use a single metal absorption line (Z_CII,Z_CIV,Z_MgII). 

Table 1 shows the best fit parameters for each of the quasar redshift estimators, and 
the quasar bias obtained is consistent among them. However, the redshift error distribution 
varies considerably for the different estimators. Our fits suggest that all the estimators 
systematically underestimate the quasar redshift by several hundreds of kms -1 , except for 
the one based on the Mgll line which is basically consistent with A z = 0, but has a larger 
dispersion e z . 

We caution, however, that these values of A z and e z are likely to change once our 
simple four-parameter model is improved with more parameters to include the quasar ionizing 
radiation effects. The systematic negative value of A z might be adjusting a real asymmetry of 
the cross-correlation introduced by the reduction of the effective quasar luminosity with the 
time delay. The effect of the time-delayed quasar ionizing radiation on the cross-correlation 
should obviously have a different radial dependence than a simple shift of the quasar redshifts, 
but some degree of degeneracy can be expected. At the same time, it seems difficult to believe 
that the true dispersion e z , arising from both observational errors and the intrinsic velocity 
dispersion of quasars within halos, is smaller than ~ 500 kms" 1 , and its value will also 
probably be modified in a more complex model that better reflects the underlying physical 
effects. 

5 Effects of the Quasar Radiation 

As we have seen in the previous section, the large-scale form of the quasar-Lya cross- 
correlation is consistent with the linear overdensity expected around the quasar host ha- 
los. This result may be surprising because of the previous detection of the proximity effect 
when measuring this cross-correlation along the line of sight, caused by the increased ion- 
ization of the intergalactic medium induced by the quasar ionizing radiation, although the 
BOSS quasars are of lower luminosity than most of the quasars on which the proximity effect 
was measured and therefore the expected additional ionization is much weaker in our case 
compared to previous studies. In this section we present a simple estimate of the expected ra- 
diation effect on the full three-dimensional cross-correlation with our quasar sample. A more 
detailed analysis involving fitting of a more complex model that includes both the radiation 
and overdensity effects is left for a future study. 
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The radiation of a quasar of luminosity L v increases the photoionization rate of gas at 
a proper distance d by the amount 

1 [°° Lyomiy) 



"HI 



where ujji is the hydrogen Lyman limit frequency, aui is the photoionization cross section, 
and h is the Planck constant. We neglect here the absorption by Lyman limit systems and 
the redshifting of the radiation, which reduces the quasar intensity when d is comparable 
to the absorption mean free path or the local horizon. Let the relative fluctuation of the 
photoionization rate relative to its average value Tq be 5r- In general, <5r can be affected by 
many sources. Neglecting the effects of ionizing source clustering (which increases the mean 
ionizing flux near a quasar beyond that emitted by the quasar itself), assuming a quasar 
spectrum L u oc v~ a , and approximating anii^) oc i/~ 3 at v > ujji, the average value of 5? 
near the quasar is 

6 Tq _ L VH1 a m (y m ) 

r r ~ 47^2/^0(3 + 0) ' { ' 

The impact of the perturbation 5r on the Lya forest can be calculated using the ap- 
proximation that the gas is purely photoionized (neglecting any contribution from collisional 
ionization) and that the neutral fraction is everywhere much smaller than unity. In this 
case, the optical depth at any point in the spectrum of the Lya forest is inversely pro- 
portional to the photoionization rate, so the fractional perturbation in the optical depth is 
5 T = 1/(1 + 6r) — 1 — — 5r 5 where the last approximate equality assumes 5r <C 1. Now, 
let Fq = e~ T ° be the Lya transmission fraction when the photoionization rate has the uni- 
form value To, with a distribution P(Fq). The transmission in the presence of radiation 
fluctuations is F = e -r °( 1-<Sr ), and the mean transmission is 

F= f dF P(F )e- To( - 1 - 8r) (5.3) 
Jo 

~ f dF P(F ) F (l + T 5 r ) 
Jo 

~F -5 T [ dF P(F ) F log(Fo) . 
J 

We can now define the radiation bias factor of the Lya forest, b-p, as the linear variation of 
the mean transmission fluctuation dp in response to a fractional variation 6r in the radiation 
intensity, analogously to the density and peculiar velocity gradient bias factors. Hence, near 
a quasar the mean transmission fluctuation will vary by 5f = fer^r owing to the fractional 
radiation perturbation Sp, where 

1 f 1 

br = _ / dF P(F )F ln(F ) . (5.4) 
r Jo 

If we use as the transmission distribution a log-normal function in the optical depth, 
with mean Fq = 0.8 and dispersion ap = 0.124, which is close to the observed distribution 
at z = 2.3, we find &r = 0.13. 

We now estimate the average intensity produced by the quasars in our sample. The 
mean flux per unit frequency of a quasar of magnitude g is f v = 10 _0 ' 4 ( 48 ' 6+9 ), expressed 
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in cgs units ([40]). The mean value of f v we obtain for our quasar sample is f u = 2.4 x 
10~ 28 ergs~ 1 cm -2 Hz -1 at the center of the (/-band (at 4800 A; this corresponds to a g magni- 
tude of 20.5) At the mean redshift z q = 2.38 of our quasar sample, the implied mean luminos- 
ity at the shifted g-band center A = 1420 A is L u = knD 2 L f u /{l + z q ) = 3.1 x 10 30 ergs -1 Hz" 1 . 
We assume a mean spectral slope from this wavelength to the Lyman limit wavelength 
L v oc v~ l ([55], [56]) , which results in L UHI = 2.0 x 10 30 ergs -1 Hz -1 . The correspond- 
ing quasar flux at a characteristic comoving distance of interest of d(l + z q ) = 20 h~~ l Mpc, 
or proper distance d = 8.33 Mpc, is then f VHI = 2.4 x 10 -22 ergs -1 cm~ 2 Hz -1 . Finally, using 
equation (5.2) with the spectral index a = 1.5 for v > vhi, the derived photoionization rate 
is T q = 5.1 x 10- 14 s- 1 . 

If we assume a mean photoionization rate from the average of all sources Tq = 10 -12 s _1 , 
then at this comoving distance of 20 h~ l Mpc, we have 5r — 0.05, and the mean perturbation 
on the transmission should be Sf — b^Sr — 0.0065. In the absence of any absorption effects, 
this perturbation should vary in proportion to d~ 2 . 

Comparing to figure 3, we see that this radiation effect should displace the value of the 
cross-correlation by one contour at r = (a 2 +7T 2 ) 1 / 2 = 20 h -1 Mpc. In the third panel of figure 
2 (for the range 7 ft -1 Mpc < a < 10/i _1 Mpc), the increase of the cross-correlation would 
be ~ 0.03 at small ir, which is larger than the difference between the data points and our 
fiducial model fit, and in the fifth panel (for the range 15 h^ 1 Mpc < cr < 20 /i" 1 Mpc), the 
increase would be ~ 0.008. These changes would significantly affect our fiducial fit, implying 
a substantial increase of b q in order to compensate for the radiation effect to a value b q ~ 5, 
which would disagree with the autocorrelation measurement of [12]. We therefore suspect 
that the quasar radiation effect is reduced relative to this simple estimate. 

The effect of the ionizing radiation is not expected to be isotropic around the quasar 
in redshift space. Quasars are likely to emit their light anisotropically, with lower intensity 
near the plane of the accretion disk around the black hole. A flux limited sample of quasars 
preferentially selects sources with their axis (the direction of brightest emission) near our line 
of sight, so on average the flux from quasars in the perpendicular direction should be lower 
than in the parallel direction. Time variability of the quasars also implies that the mean 
quasar ionizing flux affecting the Lya forest depends on the time delay, ctd = r + ir: the Lya 
forest is illuminated by the luminosity of the quasar at a time before the epoch when we 
are observing. The selection effect again causes the average quasar in a flux-limited sample 
to have lower luminosity in the past compared to the present, and this should introduce an 
asymmetry depending on the sign of ir. A hint of this signature of the radiation effect is seen 
in figure 2, in the region ir ~ —20 h" 1 Mpc and a < \tt\, which is consistent with the increase 
of 5f — 0.007 we have estimated at r = 20 h~ l Mpc from the radiation effect. If this hint is 
correct, quasars might shine at close to the expected luminosities within ~ 45° of the line 
of sight and for time delays tj, < 10 years, and have effective lower luminosities at larger 
angles from the line of sight and for longer time delays. 

In any case, obtaining solid conclusions on the contribution of the radiation effect to 
the cross-correlation and the statistical significance of any detection requires a more detailed 
modeling with more fitting parameters, additional data and a more careful inclusion of all 
the important effects. We plan to perform this study in the future when the BOSS survey is 
completed. We mention here, however, that the radiation effects we have discussed may be 
altering the value of the linear model parameters we have fitted. In particular, the redshift 
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offset A z may in part be the result of the attempt to adjust the radiation and time-delay 
effect which introduces the asymmetry depending on the sign of ir. 

5.1 Predictions for the proximity effect 

Our fitted linear theory model for the mean overdensity around a quasar can be used to make 
a prediction on the impact of this overdensity on any measurements of the proximity effect of 
quasars on the line of sight due to their ionizing radiation. In figure 6, we show the prediction 
for the line of sight cross-correlation, as 7r£(cr = 0, tt), for our two first models in table 1: 
1) the fiducial model fit using pixels at r > 15/i _1 Mpc (solid, black line), and 2) the same 
model using all pixels at r > 7 h~ l Mpc (dashed, blue line; in this subsection we choose the 
convention that tt is positive even though for the three-dimensional discussion tt is negative 
in front of the quasar). Also shown as the red, dotted line is the expected radiation effect 
in our sample of quasars, which we have plotted according to our simple estimate above as 
£ = 0.0065(20 hr x Mpc/vr) 2 . 
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Figure 6. Cross-correlation function along the line of sight, i.e., using pixels and their background 
quasar. The black solid (blue dashed) line shows our best fit theory for r > 15 hr 1 Mpc (r > 
7 hr 1 Mpc). The dotted red line shows the expected radiation effect. 

As shown in this figure, the overdensity effect should roughly cancel the radiation effect 
at 7r ~ 10 /i" 1 Mpc for the luminosities of typical BOSS quasars. Most studies of the proximity 
effect have used brighter quasars to obtain higher signal-to-noise spectra, but figure 6 provides 
a correction that should be applied to any future measurements of the proximity effect due 
to radiation under the assumption that the quasar bias factor does not depend on quasar 
luminosity. For tt > 20 /i" 1 Mpc, the overdensity effect causes an increased 5f (decreased 
absorption) that is therefore added to any radiation effect, because the peculiar velocity 
gradient due to the infall of matter toward the quasar halo is a more important effect than 
the overdensity. 
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6 Conclusions 



We have measured the cross-correlation of quasars and the Lya forest transmission in redshift 
space using more than 60000 quasars from BOSS. This unprecedentedly large number of 
quasar spectra for such a study has allowed a statistically significant detection of this cross- 
correlation out to separations of ~ 80 h~ l Mpc, and a detailed determination of its radial and 
angular dependence. The cross-correlation is consistent with the linear theory prediction of 
the standard ACDM model, with the expected redshift distortions, on scales r > 15 hr 1 Mpc. 
Fitting these large-scale measurements to a linear model with four parameters, we find that 
the BOSS quasars at z q = 2.38 have a mean bias factor b q = 3.64i ; 15 . This result is 
consistent with the quasar bias measured from the auto-correlation of a sub-sample of the 
same quasars, b q = 3.8 ± 0.3, as presented in [12]. The halo mass having this bias factor is 
Mh ~ 3 x 10 12 h~ l M & . This measurement can be compared to the halo mass corresponding 
to the mean bias factor inferred for DLAs, the other population of objects for which the bias 
factor was measured from the same approach measuring the cross-correlation with the Lya 
forest in BOSS, which was ~ 4 x 10 11 /i~ 1 M , although a large scatter in halo masses is 
expected for both populations of objects and only the mean bias factor is measured. We do 
not detect any dependence of quasar bias on luminosity or redshift, also consistent with the 
results from [12]. 

The large-scale fit to the cross-correlation also provides a measurement of the Lya 
forest redshift distortion parameter, ftp = Llj^'jI. This value is ~ 1.5<t higher than the 
one measured in [4], reducing the tension with previous numerical simulations ([57]). The 
value of (3f measured from this cross-correlation has the advantage of being less sensitive 
to systematic errors in the spectrophotometric calibration, although it should be equally 
affected by the presence of DLAs, Lyman limit systems and metal lines in the spectra which 
tend to decrease /3f ([58]). 

The cross-correlation at scales r < 15 /i -1 Mpc is not well fitted by the simple linear 
theory model we have used, because its amplitude is lower than expected in the model fitted 
to large scales, as seen in figure 5. We have argued that a likely explanation is the effects of 
the ionizing radiation of the quasars, which are of the right order of magnitude to explain this 
discrepancy for the luminosity of the BOSS quasars. If all the BOSS quasars emitted their 
light isotropically and with constant luminosities, the enhanced ionization of the surrounding 
medium would also affect the cross-correlation we have fitted at r > 15 h~ l Mpc, implying 
a higher quasar bias factor to compensate for this ionization effect. However, the quasar 
radiation effect is likely to be decreased owing to anisotropic emission and finite quasar 
lifetimes. 

The impact of the quasar ionization can also be studied by means of the line of sight 
proximity effect, and using the cross-correlation measured from especially targeted quasar 
pairs at smaller angular separations than in the BOSS sample. In the future, these studies 
will need to model the superposed effects of the mean overdensity around the quasar host 
halos and the ionization effects. Here, we have presented predictions from our fitted models 
for the correction that needs to be applied to the line of sight proximity effect for the mean 
overdensity around quasars, before attempting to infer anything about the ionization effect 
of the quasar. This correction has been neglected in the past, and is in fact small for the most 
luminous quasars that have been used to study the proximity effect, but becomes important 
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for quasars of luminosities typical of the BOSS sample. The expected redshift distortions 
also imply that the effect of the overdensity is smaller on the line of sight compared to 
the perpendicular direction, and changes sign at separations | vr | > 20/i -1 Mpc owing to the 
induced peculiar velocity gradient. We note that this predicted correction assumes linear 
theory, so it can be altered at small scales. 

The quasar-Lya cross-correlation can also be useful to constrain the redshift errors of 
the quasars. Our results for the mean redshift offset Az indicate that the quasar estimators 
that have been used are systematically too low, except for the one based on Mgll which is 
closest to zero. We have warned, however, that the result for this redshift offset may be 
affected by the quasar radiation effect with finite quasar lifetimes, which can introduce an 
asymmetry depending on the sign of ir. 

Future studies of the quasar-Lya cross-correlation have a promising potential for prob- 
ing both the large-scale distribution of matter around quasars and the characteristics of the 
ionizing emission. The Baryon Acoustic Oscillation peak, detected recently in the Lya au- 
tocorrelation ([54], [59]), can also be detected in the quasar-Lya cross-correlation. The ratio 
of the BAO peak amplitudes in the monopole, £, q( ^° / £,aa° i should be equal to ([35]) 

_b q l + ((3 F + p q )/3 + p F (3 q /5 
S™> b F l + 2 /3 F /3 + /32/5 • l °- > 

Using the values b q = 3.7, fip = 1.1, and bp = 0.16, we find ^ q a° l^aa° — 18, and using 
£aa° - 2 x !0" 5 ([54]), we expect ~ 4 x 10~ 4 . This BAO peak should be detectable 

in the cross-correlation before the end of the BOSS survey (see figure 2). At the same time, 
detailed modeling of the cross-correlation on a broad range of scales will hopefully allow for 
the separation of the overdensity and radiation effects, allowing for a measurement of the 
characteristic anisotropy of quasar emission and of the lifetimes of quasars. 
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